8370041: GenShen: Filter young pointers from thread local SATB buffers when only marking old #27983

earthling-amzn · 2025-10-24T21:00:40Z

When GenShen is only marking the old generation, we do not need the SATB mechanism to preserve young pointers. We currently filter these out of the SATB buffers during the final-update-refs and init-mark safepoints. This increases latency and introduces no small amount of complexity. It should be possible to instead filter out these pointers when the SATB buffers are 'compacted' before being 'completed'.

Background

When GenShen is marking the old generation it leaves the SATB barrier enabled. When a young collection interrupts old marking, it creates a situation where a mutator thread could overwrite a field holding a pointer into a collection set region. The SATB barrier will dutifully place this object in the SATB queue. If this pointer makes it into a mark queue, the marking thread will crash. Prior to this change, GenShen filtered out such pointers after the thread local SATB buffers were completed. After this change, such pointers are filtered out before the buffers are completed. This is more inline with the natural way of things.

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8370041: GenShen: Filter young pointers from thread local SATB buffers when only marking old (Enhancement - P4)

Reviewers

Kelvin Nilsen (@kdnilsen - Committer)
Y. Srinivas Ramakrishna (@ysramakrishna - Reviewer)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/27983/head:pull/27983
$ git checkout pull/27983

Update a local copy of the PR:
$ git checkout pull/27983
$ git pull https://git.openjdk.org/jdk.git pull/27983/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 27983

View PR using the GUI difftool:
$ git pr show -t 27983

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/27983.diff

Using Webrev

Link to Webrev Comment

…not being marked

Otherwise, the filter won't filter out young (possibly garbage) pointers. I don't think this needs to happen on a safepoint. We could probably even do it as part of pausing old GC.

…d pointers

This should be the last and only time it is necessary

…n-update-roots

bridgekeeper · 2025-10-24T21:01:35Z

👋 Welcome back wkemper! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-10-24T21:02:48Z

@earthling-amzn This change now passes all automated pre-integration checks.

ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details.

After integration, the commit message for the final commit will be:

8370041: GenShen: Filter young pointers from thread local SATB buffers when only marking old

Reviewed-by: kdnilsen, ysr

You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed.

At the time when this comment was updated there had been 51 new commits pushed to the master branch:

d5831ed: 8357880: Code formatting typo in Cipher.getMaxAllowedParameterSpec
1357be9: 8371178: Preserve fast version of getfield and putfield in AOTCache
acc8a76: 8357034: GifImageDecoder can produce wrong transparent pixels
... and 48 more: https://git.openjdk.org/jdk/compare/1922c4fd6f10e6eac121462d509d6990ae4f9acd...master

As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details.

➡️ To integrate this PR with the above commit message to the master branch, type /integrate in a new comment.

openjdk · 2025-10-24T21:03:27Z

@earthling-amzn The following labels will be automatically applied to this pull request:

hotspot-gc
shenandoah

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing lists. If you would like to change these labels, use the /label pull request command.

mlbridge · 2025-10-24T21:06:13Z

Webrevs

kdnilsen

This is a great simplification. Do we have any performance numbers, especially for reduction in p99.999 and p100 latencies with certain Extremem workloads, which I believe to be related to safepoint flushing of satb buffers?

kdnilsen · 2025-10-24T21:16:10Z

src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp

+  //     be in the collection set. If this happens, the pointer will be preserved, essentially
+  //     becoming part of the old snapshot.
+  //  2. The region is allocated during evacuation of old. This is also not a concern because
+  //     we haven't yet finished marking old so no mixed evacuations will happen.


Might be worth mentioning that there may be some additional analysis and code required when we use the forwarding table to recycle cset regions during evacuation and/or updating. If one of these regions becomes old before the SATB buffers have been flushed, then a young cset pointer that lingers in a SATB buffer will "all of a sudden" look like a valid old pointer and will not be purged from the SATB buffer. When we subsequently scan the "object" referenced by this obsolete pointer, we are likely to find "garbage memory", possibly resulting in a crash.

I am thinking that an initial fix might be to do this flushing at init-update-refs instead of at final update refs, and to not recycle cset regions until evacuation is all done. Is there a different handshake there that we might piggyback on?

Hmm, that's a good point about recycling regions concurrently. I don't think we can flush before init-update-refs because forwarded pointers still exist and the SATB barrier doesn't try to resolve them. That is to say, "bad" pointers can be created throughout the update-refs phase.

Even in the (hypothetical) scenario with a forwarding table, a region would only become old through an in-place-promotion (in which case, it will never have been recycled) or we will be doing a mixed evacuation. If we are running a mixed evacuation, we will already have finished marking old.

earthling-amzn · 2025-10-24T22:17:21Z

I have results from an earlier version of this PR that flushed the buffers during init-mark. That version showed a consistent 3% improvement on critical and max jops. They also showed a reduction in p99.999 latency for some of the extremem measurements. I will retest with this version.

earthling-amzn · 2025-10-28T16:09:58Z

I found an issue with the verifier that needs to be fixed before this PR is integrated.

…s in progress This has to happen at least once during the degenerated cycle. Doing it at the start, rather than the end, simplifies the verifier.

…n-update-roots

earthling-amzn · 2025-11-04T22:25:00Z

I've run more tests and confirmed that critical and max jops are 3% improved on a variety of heap sizes and configurations. Additionally, after running more tests with extremem, the apparent regression at p100 has evaporated:

genshen/extremem/control                                                                                                                                                                                                  
                                Category |  Count |         Total |      GeoMean |      Average |     Trim 0.1 |       StdDev |      Minimum |      Maximum                                                               
                  sales_transaction_p100 |     23 |     38817.000 |     1685.651 |     1687.696 |     1690.158 |       84.299 |     1539.000 |     1823.000                                                               
                   browsing_history_p100 |     23 |     32145.000 |     1391.269 |     1397.609 |     1382.211 |      139.779 |     1175.000 |     1769.000                                                               
               customer_replacement_p100 |     23 |    107141.000 |     4652.940 |     4658.304 |     4664.211 |      227.279 |     4093.000 |     5053.000                                                               
                product_replacement_p100 |     23 |     58315.000 |     2526.181 |     2535.435 |     2523.737 |      223.834 |     2203.000 |     3041.000                                                               
               customer_preparation_p100 |     23 |    142953.000 |     5442.520 |     6215.348 |     6064.000 |     3132.333 |     2502.000 |    11547.000                                                               
                  customer_purchase_p100 |     23 |    491201.000 |    18622.792 |    21356.565 |    20385.263 |    11424.057 |     6582.000 |    48274.000                                                               
            customer_save_for_later_p100 |     23 |    880105.000 |    36675.774 |    38265.435 |    37248.895 |    11777.757 |    23023.000 |    65591.000                                                               
               customer_abandonment_p100 |     23 |    701197.000 |    28578.563 |    30486.826 |    29311.105 |    11521.080 |    16390.000 |    59179.000                                                               
genshen/extremem/experiment                                                                                                                                                                                               
                                Category |  Count |         Total |      GeoMean |      Average |     Trim 0.1 |       StdDev |      Minimum |      Maximum                                                               
                  sales_transaction_p100 |     23 |     40007.000 |     1730.529 |     1739.435 |     1711.947 |      193.896 |     1528.000 |     2490.000                                                               
                   browsing_history_p100 |     23 |     33032.000 |     1425.803 |     1436.174 |     1436.316 |      175.753 |     1123.000 |     1729.000                                                               
               customer_replacement_p100 |     23 |    107516.000 |     4653.546 |     4674.609 |     4599.579 |      488.862 |     4072.000 |     6579.000                                                               
                product_replacement_p100 |     23 |     56647.000 |     2451.855 |     2462.913 |     2469.789 |      235.896 |     1968.000 |     2903.000
               customer_preparation_p100 |     23 |    136482.000 |     5224.974 |     5934.000 |     5766.684 |     3027.151 |     2924.000 |    10652.000
                  customer_purchase_p100 |     23 |    464888.000 |    17675.074 |    20212.522 |    18641.263 |    11355.233 |     7921.000 |    55420.000
            customer_save_for_later_p100 |     23 |    854932.000 |    35744.969 |    37170.957 |    35660.579 |    11323.562 |    24370.000 |    71376.000
               customer_abandonment_p100 |     23 |    686923.000 |    28261.963 |    29866.217 |    28589.737 |    10907.943 |    17791.000 |    65329.000

Indeed, the experiment looks slightly better in some cases (slightly worse in others). The results also show the expected reduction in safepoint times as we are no longer flushing SATB buffers during final_update_refs or init_mark:

-133.33% extremem/shenandoahfinalupdaterefs_stopped_max p=0.00000 (Welch's T-Test)
  Control:      1.103   (+/-  0.14  )         23
  Test:         0.473   (+/-  0.06  )         23

The effect is more pronounced on specjbb2015:

-216.54% specjbb2015/shenandoahfinalupdaterefs_stopped_max p=0.00000 (Mann-Whitney)                                                                                                                                                 
  Control:      2.217   (+/-  0.09  )         22                                                                                                                                                                          
  Test:         0.700   (+/-  0.19  )         22 

-791.43% specjbb2015/shenandoahinitmark_stopped_max p=0.00173 (Mann-Whitney)                                                                                                                                                        
  Control:      3.408   (+/-  3.12  )         22                                                                                                                                                                          
  Test:         0.382   (+/-  0.07  )         22

kdnilsen

Thank you for bringing this to closure...

ysramakrishna

Changes look ok. I found some of the comments confusing -- I have left some remarks at those places, please have a look to see if they can be made clearer.

The improvement in performance looks good. Do we track the number of SATB pointers processed by the old marking (to compare between before and after your changes here)?

ysramakrishna · 2025-11-06T01:04:04Z

src/hotspot/share/gc/shenandoah/shenandoahConcurrentGC.cpp

+  //  1. The region is promoted in place. This is safe because such regions will never
+  //     be in the collection set. If this happens, the pointer will be preserved, essentially
+  //     becoming part of the old snapshot.
+  //  2. The region is allocated during evacuation of old. This is also not a concern because


One related question. In both these cases, I assume the reference will look "marked" because it's above TAMS for the purposes of the old marking?

ysramakrishna · 2025-11-06T01:16:42Z

src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.hpp

  // We leave the SATB barrier on for the entirety of the old generation
  // marking phase. In some cases, this can cause a write to a perfectly
  // reachable oop to enqueue a pointer that later becomes garbage (because
  // it points at an object that is later chosen for the collection set). There are


// ... In some cases, this can cause a write to a perfectly // reachable oop to enqueue a pointer that later becomes garbage (because // it points at an object that is later chosen for the collection set).

I don't understand this statement. The SATB is supposed to be pointers to objects that we will preserve because they were reachable when the snapshot (marking) was started. Can you elaborate what you mean here? Did you mean that the filtering of the SATB didn't filter a (sometime) young reference which was then processed by the old marking?

ysramakrishna · 2025-11-06T01:18:52Z

src/hotspot/share/gc/shenandoah/shenandoahOldGeneration.hpp

  // also cases where the referent of a weak reference ends up in the SATB
  // and is later collected. In these cases the oop in the SATB buffer becomes
  // invalid and the _next_ cycle will crash during its marking phase. To


Again I don't understand the concept of an SATB pointer to an object that was later collected? Are we talking about young objects that are subsequently processed by old marking because they weren't filtered out when they should be?

I think that is probably the case here, but it would be good to clean up these comments to avoid this confusion.

earthling-amzn added 18 commits October 16, 2025 15:27

Filter out young pointers from thread local SATB buffers if young is …

8493d54

…not being marked

Temporarily increase log level

ac36718

Remove instrumentation

bdbd8e3

Comment out all calls to transfer old pointers out of SATB

039a08d

Need to flush thread local SATB buffers before young marking

bbf207c

Otherwise, the filter won't filter out young (possibly garbage) pointers. I don't think this needs to happen on a safepoint. We could probably even do it as part of pausing old GC.

Avoid checking gc-state every time we filter a SATB buffer

1301df4

Stop processing completed SATB buffers, they should no longer have ba…

eb65003

…d pointers

Manage satb filter at safepoint exit

05aa120

Remove unused code

004a954

Flush old satb after update references

03a4dad

This should be the last and only time it is necessary

Try piggybacking satb flush on update roots

e4f4f8a

Oops, move inline definition out of ifdef ASSERT

584b532

Fix assertion

bafd55d

Merge remote-tracking branch 'jdk/master' into piggyback-satb-flush-o…

feeeaaf

…n-update-roots

Cleanup and comments

f47e976

Only flush satb once during degenerated cycle

72d89d4

Remove duplicate satb flush closure

48d4941

Fix typo in comment

ec58b72

openjdk bot added hotspot-gc hotspot-gc-dev@openjdk.org shenandoah shenandoah-dev@openjdk.org labels Oct 24, 2025

openjdk bot added the rfr Pull request is ready for review label Oct 24, 2025

kdnilsen reviewed Oct 24, 2025

View reviewed changes

earthling-amzn marked this pull request as draft October 28, 2025 16:09

openjdk bot removed the rfr Pull request is ready for review label Oct 28, 2025

earthling-amzn added 2 commits October 28, 2025 13:29

Flush SATB buffers upon entering degenerated cycle when old marking i…

ef21a81

…s in progress This has to happen at least once during the degenerated cycle. Doing it at the start, rather than the end, simplifies the verifier.

Merge remote-tracking branch 'jdk/master' into piggyback-satb-flush-o…

01f0f97

…n-update-roots

earthling-amzn marked this pull request as ready for review October 30, 2025 23:59

openjdk bot added the rfr Pull request is ready for review label Oct 31, 2025

Merge remote-tracking branch 'jdk/master' into piggyback-satb-flush-o…

4bd602d

…n-update-roots

kdnilsen approved these changes Nov 5, 2025

View reviewed changes

ysramakrishna approved these changes Nov 6, 2025

View reviewed changes

openjdk bot added the ready Pull request is ready to be integrated label Nov 6, 2025

8370041: GenShen: Filter young pointers from thread local SATB buffers when only marking old #27983

Are you sure you want to change the base?

8370041: GenShen: Filter young pointers from thread local SATB buffers when only marking old #27983

Conversation

earthling-amzn commented Oct 24, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Progress

Issue

Reviewers

Reviewing

Uh oh!

bridgekeeper bot commented Oct 24, 2025

Uh oh!

openjdk bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openjdk bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mlbridge bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

kdnilsen left a comment

Choose a reason for hiding this comment

Uh oh!

kdnilsen Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

earthling-amzn Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

earthling-amzn commented Oct 24, 2025

Uh oh!

earthling-amzn commented Oct 28, 2025

Uh oh!

earthling-amzn commented Nov 4, 2025

Uh oh!

kdnilsen left a comment

Choose a reason for hiding this comment

Uh oh!

ysramakrishna left a comment

Choose a reason for hiding this comment

Uh oh!

ysramakrishna Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

ysramakrishna Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

ysramakrishna Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

earthling-amzn commented Oct 24, 2025 •

edited by openjdk bot

Loading

openjdk bot commented Oct 24, 2025 •

edited

Loading

openjdk bot commented Oct 24, 2025 •

edited

Loading

mlbridge bot commented Oct 24, 2025 •

edited

Loading

kdnilsen Oct 24, 2025 •

edited

Loading